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ESTIMATING STELLAR FUNDAMENTAL PARAMETERS USING PCA: 
APPLICATION TO EARLY TYPE STARS OF GES DATA 
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Abstract. This work addresses a procedure to estimate fundamental stellar parameters such as T e ff, logg, 
[Fe/H], and vsini using a dimensionality reduction technique called principal component analysis (PCA), 
applied to a large database of synthetic spectra. This technique shows promising results for inverting stellar 
parameters of observed targets from Gaia Eso Survey. 
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1 Introduction 


With the introduction of new telescopes and instruments to the scientific astronomical community, and the rapid 
increase of sky surveys such as SDSS and RAVE, tremendous amount of spectral data is being acquired on a 
daily basis and with an increasing rate. Therefore, these challenges urged the need of efficient and automated 
techniques to handle and analyze this huge amount of information. Such automated procedures for classification 
of stars have been discussed recently using different codes and mathematical approaches. As an example, one 
can mention the methods used to analyze a spectral library described in Jofre et al. (2014). 

In this work, we present a dimensionality reduction technique called PCA, applied to a huge database of 
synthetic spectra. PCA searches in a high dimensional space for possible correlations, and finds an optimal 
basis for representing the data in a compact way. Due to the high number of spectra in each synthetic database 
(^200 000), and the high number of data points in each spectral domain (^2 500. Same as observation, see 
section 2), such technique is crucial for inverting stellar parameters of observed targets from Gaia ESO survey. 
Using PCA, data can be represented in a fewer number of data points, allowing a fast “nearest neighbor(s)” 
search between the observed data set and the synthetic spectra. This study is an extension of |Paletou et al. 
(2015) where the H-R domain of application has been extrapolated to stars of types earlier than F, and the 


training database used in this work is a set of synthetic spectra. 


2 Observation 

The procedure is applied to more than 800 stars, members of the open clusters NGC3293, NGC6705, and 
Trumplerl4. The observations are part of the GAIA ESO public survey and consist of 2 spectral ranges, one 
samples the line region [4030-4200] Aand the second samples the [4400-4550] A(HR5) region. These spectra 
were taken using GIRAFFE/FLAMES spectrograph at a resolution R ~ 25 000, and reduced by the GES. 


3 Spectral range selection 

Balmer lines, due to the broadening caused by the Stark’s effect, are excellent indicators of effective temperature 
and surface gravity (Gray 2005). The reason behind studying in particular is because this line is formed 
in deep enough atmospheric layers where LTE can still be considered as a reasonable assumption. Moreover, 
the HR5 region was chosen since metallic lines (namely Fell, Mgn, Till, ...) are potentially good indicators of 
rotational velocity and metallicity. 
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4 Synthetic Spetra 


LTE model atmospheres were calculated using ATLAS9 code (Kurucz 1992) and were used as input to the 


spectrum synthesis code SYNSPEC48 ( Hubeny fe Lanz |1992 ) in order to compute a large grid of synthetic line 


profiles, over the same spectral regions as the observations. Spectra were calculated for T e ff between 5 000 and 
15 000 K, gravities between 2.0 and 5.0 cgs, rotational velocities between 0 and 200 km s _1 , and metalicities 
between -0.6 and 0.4 dex (only for the HR5 region, whereas a solar [Fe/H] was assumed for the H$ region), all 
at a microturbulence of 2 km s -1 and at a resolution of 25 000. 


5 Procedure 


The central idea of principal component analysis is to reduce the dimensionality of a data set in which there 
are a large number of interrelated variables, while retaining as much as possible of the variation present in the 
data set (J olliffe||198 6). PC A searches for basis vectors that represent most of the variance in a given database. 
These vectors (<?&) are in fact the eigenvectors of the variance-covariance matrix (5.1) of the synthetic data set 


C = (S —S) T .(S-S) 


(5.1) 


Where S being the mean spectrum over all the database. 

Once the basis is obtained (adopted a set of 12 vectors, i.e k = 1,12), the synthetic spectra and each 
observation (O) are projected unto this basis to obtain the projected coefficients (5.2 & 5.3) 


Pj,k 


= (S j - S).e fe g 


(5.2) 


Pk = (O - S).e k 


(5.3) 


Then, a standard chi-squared (5.4) is performed in this low dimensional space in order to achieve a fast inversion 


of stellar parameters of the observed targets. The parameters of the synthetic spectrum having the minimum d 
will be considered as the observation fundamental parameters. 


df ] = -Pj,kf 


(5.4) 


The observation spectra were radial velocity corrected, and those with low signal-to-noise ratio were filtered 
out. Upon starting the inversion process, the technique showed to be very sensitive to normalization of spectra, 
thus an iterative “re-” normalization procedure was performed according to Gazzano et al. (2010). 


6 Results 

In general, inversion based on this technique was performed over the selected stars, and the fundamental 
parameters of the targets were estimated. An example of the nearest neighbor search is given in figures [l] and [2] 
The parameters derived by PCA, along with the non-official parameters obtained by WG13 of GES are detailed 
in table [lj 


Table 1 . Results derived using PCA along with parameters given by WG13 of GES 



Derived 

Given 

Star 

TeS 

log g 

v sini 

[Fe/H] 

TqK 

log# 

v sini 

[Fe/H] 


(K) 

(dex) 

(km s -1 ) 

(dex) 

(K) 

(dex) 

(km s -1 ) 

(dex) 

10361733-5809031 

14400 

3.6 

45 

- 

14 775 

3.84 

44 

- 

10430337-5941536 

9 200 

4.6 

70 

0.4 

8 633 

3.5 

75 

- 


With PCA, we will be contributing by determining stellar parameters to the next GES data release. 


S j is the j th spectrum (a row vector) in the database (matrix) S. 
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Fig. 1 . Example of the fitting of the line of the star 10361733-5809031 member of NGC3293 cluster, with 
a synthetic spectrum. Blue being the observed spectrum, while red the fitted synthetic. 
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Fig. 2. Example of the fitting of the region containing Fell, Mgll and Till lines of the star 10430337-5941536 
member of Trumplerl4 cluster, with a synthetic spectrum. Blue being the observed spectrum, while red the 
fitted synthetic. 


7 Conclusions and future work 

PC A proved to be a fast and reliable inversion technique, with an ease to implement. An attempt to increase the 
size of the synthetic database is being performed in order to improve the accuracy in the parameters obtained. 
Moreover, the merging of two spectral ranges in a one data set is also considered as a future work. 

This work is based on observations collected with the FLAMES spectrograph at the VLT/UT2 telescope (Paranal Observatory, 
ESO, Chile), for the Gaia-ESO Large Public Survey, programme 188.B-3002. 
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